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<210> 1 
<211> 2562 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<220> 

<223> full length human immunodeficiency virus -1 (HIV-1) 
envelope glycoprotein (Env, gpl60) from primary 
R5X4 isolate 89.6 

<400> 1 

atgagagtga aggagatcag gaagaattgg cagcacttga gagggggcat cttgctcctt 60 
gggatgttga tgatctgtag tgctgcaaaa gaaaaaacgt gggtcacaat ctattatggg 12 0 
gtacctgtgt ggagagaagc aaccaccact ctattttgtg catcagatgc taaagcctat 180 
gatacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccccaaccca 240 
caagaagtag tattgggaaa tgtgacagaa aattttaaca tgtggaaaaa taacatggta 300 
gatcagatgc atgaggatat aatcagttta tgggatgaaa gcctaaagcc atgtgtaaaa 360 
ttaaccccac tctgtgttac tttaaattgc actaatttga atatcactaa gaatactact 420 
aatcccacta gtagcagctg gggaatgatg gagaaaggag aaataaaaaa ttgctctttc 4 80 
tatatcacca caagcataag aaataaggta aagaaagaat atgcactttt taatagactt 540 
gatgtagtac caatagaaaa tactaataat actaagtata ggttaataag ttgtaacacc 600 
tcagtcatta cacaggcctg tccaaaggta tcctttcagc caattcccat acattattgt 660 
gtcccggctg ggtttgcgat gctaaagtgt aacaataaga cattcaatgg atcaggacca 72 0 
tgcacaaatg tcagcacagt acaatgtaca catggaatta ggccagtggt gtcaactcaa 780 
ctgctgttaa atggcagtct agcagaagaa gacatagtaa ttagatctga aaatttcaca 84 0 
gacaatgcta aaaccataat agtacagcta aatgaatctg tagtaattaa ttgtacaaga 900 
cccaacaaca atacaagaag aaggttatct ataggaccag ggagagcatt ttatgcaaga 960 
agaaacataa taggagatat aagacaagca cattgtaaca ttagtagagc aaaatggaat 102 0 
aacactttac aacagatagt tataaaatta agagaaaaat ttaggaataa aacaatagcc 1080 
tttaatcaat cctcaggagg ggacccagaa attgtaatgc acagttttaa ttgtggaggg 1140 
gaatttttct actgtaatac agcacaactg tttaatagta cttggaatgt tactggaggg 12 00 
acaaatggca ctgaaggaaa tgacataatc acactccaat gcagaataaa acaaattata 12 60 
aatatgtggc agaaagtagg aaaagcaatg tatgcccctc ccatcacagg acaaattaga 1320 
tgttcatcaa atattacagg gctgctacta acaagagatg gaggtaatag tactgagact 13 80 
gagactgaga tcttcagacc tggaggagga gatatgaggg acaattggag aagtgaatta 144 0 
tataaatata aagtagtaag aattgaacca ataggagtag cacccaccag ggcaaagaga 1500 



1 



agaacagtgc 
ggagcagcag 
ttattgtctg 
catatgttgc 
gaaagatacc 
tgcaccactt 
aataacatga 
gacttacttg 
gataaatggg 
ttattcataa 
atagtaaata 
tcgaggggac 
agatccggtc 
tgcctcttcc 
cttctgggac 
agccaggaac 
gaggggacag 
cctacaagaa 



aaagagaaaa 
gaagcactat 
gtatagtgca 
aactcacagt 
taagggatca 
ctgtgccttg 
cctggatgga 
aaaaatcgca 
caagtttgtg 
tgatagtagg 
gagttaggca 
ccgacaggcc 
cattagtgaa 
tctaccacct 
gcagggggtg 
taaagaatag 
atagggttat 
tcagacaggg 



aagagcagtg 
gggcgcagcg 
gcagcagaac 
ctggggcatc 
acagctcatg 
gaatgttagt 
gtgggaaaga 
aacccaacaa 
gaattggttt 
aggcttgata 
gggatattca 
cgaaggaaca 
cggattcttg 
cttgagaaac 
ggaagccctc 
tgctgttagc 
aaaaatagta 
cttggaaagg 



ggaataggag 
tcagtgacgc 
aatctgctga 
aagcagctcc 
ggaatttggg 
tggagtaata 
gaaattgaca 
gaaaagaatg 
gacataacaa 
ggtttaagaa 
ccattatcgt 
gaagaagaag 
gcacttttct 
ttactcttga 
aaatattggt 
ttgctcaacg 
caaagagctt 
gctttgctat 



ctgtgttcct 
tgacggtaca 
gggctattga 
aggcaagagt 
gttgctctgg 
aatctgtgga 
attacacaga 
aaaaagaatt 
actggctgtg 
tagtttttgc 
ttcagaccct 
gtggagagag 
gggtcgattt 
ttgtaacgag 
ggaatctcct 
ccacagccat 
gtagagctat 
aa 



tgggttcttg 
ggccaggcta 
ggcgcaacag 
cctggctctg 
aaaactcatt 
tgatatttgg 
ctatatatat 
attggaattg 
gtatataaga 
tgtactttct 
cctcccagcc 
agacagagac 
gaggaacctg 
gattgtggaa 
gcaatattgg 
agcagtagct 
tcgcaacata 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2562 



<210> 2 
<211> 853 
<212> PRT 

<213> Human immunodeficiency virus type 1 
<220> 

<223> full length human immunodeficiency virus-1 (HIV-1) 
envelope glycoprotein (Env, gpl60) from primary 
R5X4 isolate 89.6 

<400> 2 

Met Arg Val Lys Glu lie Arg Lys Asn Trp Gin His Leu Arg Gly Gly 
1 5 10 15 

lie Leu Leu Leu Gly Met Leu Met lie Cys Ser Ala Ala Lys Glu Lys 
20 25 30 

Thr Trp Val Thr lie Tyr Tyr Gly Val Pro Val Trp Arg Glu Ala Thr 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 

Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn Pro Thr Ser 
130 135 140 

Ser Ser Trp Gly Met Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 
145 150 155 160 



2 



Tyr lie Thr Thr 



Phe Asn Arg Leu 
180 

Tyr Arg Leu lie 
195 

Lys Val Ser Phe 
210 

Phe Ala Met Leu 
225 

Cys Thr Asn Val 



Val Ser Thr Gin 
260 

Val lie Arg Ser 
275 

Gin Leu Asn Glu 
290 

Thr Arg Arg Arg 
305 

Arg Asn lie lie 



Ala Lys Trp Asn 
340 

Lys Phe Arg Asn 
355 

Pro Glu lie Val 
370 

Cys Asn Thr Ala 
385 

Thr Asn Gly Thr 



Lys Gin He He 
420 

Pro Pro He Thr 
435 

Leu Leu Thr Arg 
450 

Phe Arg Pro Gly 
465 




Ser He Arg Asn 
165 

Asp Val Val Pro 



Ser Cys Asn Thr 
200 

Gin Pro He Pro 
215 

Lys Cys Asn Asn 
230 

Ser Thr Val Gin 
245 

Leu Leu Leu Asn 



Glu Asn Phe Thr 
280 

Ser Val Val He 
295 

Leu Ser He Gly 
310 

Gly Asp He Arg 
325 

Asn Thr Leu Gin 



Lys Thr He Ala 
360 

Met His Ser Phe 
375 

Gin Leu Phe Asn 
390 

Glu Gly Asn Asp 
405 

Asn Met Trp Gin 



Gly Gin He Arg 
440 

Asp Gly Gly Asn 
455 

Gly Gly Asp Met 
470 



Lys Val Lys Lys 
170 

He Glu Asn Thr 
185 

Ser Val He Thr 



He His Tyr Cys 
220 

Lys Thr Phe Asn 
235 

Cys Thr His Gly 
250 

Gly Ser Leu Ala 
265 

Asp Asn Ala Lys 



Asn Cys Thr Arg 
300 

Pro Gly Arg Ala 
315 

Gin Ala His Cys 
330 

Gin He Val He 
345 

Phe Asn Gin Ser 



Asn Cys Gly Gly 
380 

Ser Thr Trp Asn 
395 

He He Thr Leu 
410 

Lys Val Gly Lys 
425 

Cys Ser Ser Asn 



Ser Thr Glu Thr 
460 

Arg Asp Asn Trp 
475 




Glu Tyr Ala Leu 
175 

Asn Asn Thr Lys 
190 

Gin Ala Cys Pro 
205 

Val Pro Ala Gly 



Gly Ser Gly Pro 
240 

He Arg Pro Val 
255 

Glu Glu Asp He 
270 

Thr He He Val 
285 

Pro Asn Asn Asn 



Phe Tyr Ala Arg 
320 

Asn He Ser Arg 
335 

Lys Leu Arg Glu 
350 

Ser Gly Gly Asp 
365 

Glu Phe Phe Tyr 



Val Thr Gly Gly 
400 

Gin Cys Arg He 
415 

Ala Met Tyr Ala 
430 

He Thr Gly Leu 
445 

Glu Thr Glu He 



Arg Ser Glu Leu 
480 
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Tyr Lys Tyr Lys Val Val Arg lie Glu Pro lie Gly Val Ala Pro Thr 
485 490 495 

Arg Ala Lys Arg Arg Thr Val Gin Arg Glu Lys Arg. Ala Val Gly lie 
500 505 510 

Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 
515 520 525 

Ala Ala Ser Val Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly 
530 535 540 

lie Val Gin Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin 
545 550 555 560 

His Met Leu Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg 
565 570 575 

Val Leu Ala Leu Glu Arg Tyr Leu Arg Asp Gin Gin Leu Met Gly lie 
580 585 590 

Trp Gly Cys Ser Gly Lys Leu lie Cys Thr Thr Ser Val Pro Trp Asn 
595 600 605 

Val Ser Trp Ser Asn Lys Ser Val Asp Asp lie Trp Asn Asn Met Thr 
610 615 620 

Trp Met Glu Trp Glu Arg Glu lie Asp Asn Tyr Thr Asp Tyr lie Tyr 
625 630 635 640 

Asp Leu Leu Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu 
645 650 655 

Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp lie 
660 665 670 

Thr Asn Trp Leu Trp Tyr lie Arg Leu Phe lie Met lie Val Gly Gly 
675 680 685 

Leu lie Gly Leu Arg lie Val Phe Ala Val Leu Ser He Val Asn Arg 
690 695 700 

Val Arg Gin Gly Tyr Ser Pro Leu Ser Phe Gin Thr Leu Leu Pro Ala 
705 710 715 720 

Ser Arg Gly Pro Asp Arg Pro Glu Gly Thr Glu Glu Glu Gly Gly Glu 
725 730 735 

Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Ala Leu 
740 745 750 

Phe Trp Val Asp Leu Arg Asn Leu Cys Leu Phe Leu Tyr His Leu Leu 
755 760 765 

Arg Asn Leu Leu Leu He Val Thr Arg He Val Glu Leu Leu Gly Arg 
770 775 780 

Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gin Tyr Trp 
785 790 795 800 
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Ser Gin Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr Ala 

805 810 815 

lie Ala Val Ala Glu Gly Thr Asp Arg Val lie Lys lie Val Gin Arg 
820 825 830 

Ala Cys Arg Ala lie Arg Asn lie Pro Thr Arg lie Arg Gin Gly Leu 
835 840 845 



Glu Arg Ala Leu Leu 
850 



<210> 3 
<211> 2051 
<212> DNA 

<213> Human immunodeficiency virus type 1 
<220> 

<22 3> human immunodeficiency virus -1 (HIV-1) envelope 
glycoprotein gpl40 truncated version of gp 160 
from primary R5X4 isolate 89.6 



<400> 3 

atgagagtga 

gggatgttga 

gtacctgtgt 

gatacagagg 

caagaagtag 

gatcagatgc 

ttaaccccac 

aatcccacta 

tatatcacca 

gatgtagtac 

tcagtcatta 

gtcccggctg 

tgcacaaatg 

ctgctgttaa 

gacaatgcta 

cccaacaaca 

agaaacataa 

aacactttac 

tttaatcaat 

gaatttttct 

acaaatggca 

aatatgtggc 

tgttcatcaa 

gagactgaga 

tataaatata 

agaacagtgc 

ggagcagcag 

ttattgtctg 

catatgttgc 

gaaagatacc 

tgcaccactt 

aataacatga 

gacttacttg 

gataaatggg 

ttattcataa 



aggagatcag 
tgatctgtag 
ggagagaagc 
tacataatgt 
tattgggaaa 
atgaggatat 
tctgtgttac 
gtagcagctg 
caagcataag 
caatagaaaa 
cacaggcctg 
ggtttgcgat 
tcagcacagt 
atggcagtct 
aaaccataat 
atacaagaag 
taggagatat 
aacagatagt 
cctcaggagg 
actgtaatac 
ctgaaggaaa 
agaaagtagg 
atattacagg 
tcttcagacc 
aagtagtaag 
aaagagaaaa 
gaagcactat 
gtatagtgca 
aactcacagt 
taagggatca 
ctgtgccttg 
cctggatgga 
aaaaatcgca 
caagtttgtg 
t 



gaagaattgg 
tgctgcaaaa 
aaccaccact 
ttgggccaca 
tgtgacagaa 
aatcagttta 
tttaaattgc 
gggaatgatg 
aaataaggta 
tactaataat 
tccaaaggta 
gctaaagtgt 
acaatgtaca 
agcagaagaa 
agtacagcta 
aaggttatct 
aagacaagca 
tataaaatta 
ggacccagaa 
agcacaactg 
tgacataatc 
aaaagcaatg 
gctgctacta 
tggaggagga 
aattgaacca 
aagagcagtg 
gggcgcagcg 
gcagcagaac 
ctggggcatc 
acagctcatg 
gaatgttagt 
gtgggaaaga 
aacccaacaa 
gaattggttt 



cagcacttga 
gaaaaaacgt 
ctattttgtg 
catgcctgtg 
aattttaaca 
tgggatgaaa 
actaatttga 
gagaaaggag 
aagaaagaat 
actaagtata 
tcctttcagc 
aacaataaga 
catggaatta 
gacatagtaa 
aatgaatctg 
ataggaccag 
cattgtaaca 
agagaaaaat 
attgtaatgc 
tttaatagta 
acactccaat 
tatgcccctc 
acaagagatg 
gatatgaggg 
ataggagtag 
ggaataggag 
tcagtgacgc 
aatctgctga 
aagcagctcc 
ggaatttggg 
tggagtaata 
gaaattgaca 
gaaaagaatg 
gacataacaa 



gagggggcat 
gggtcacaat 
catcagatgc 
tacccacaga 
tgtggaaaaa 
gcctaaagcc 
atatcactaa 
aaataaaaaa 
atgcactttt 
ggttaataag 
caattcccat 
cattcaatgg 
ggccagtggt 
ttagatctga 
tagtaattaa 
ggagagcatt 
ttagtagagc 
ttaggaataa 
acagttttaa 
cttggaatgt 
gcagaataaa 
ccatcacagg 
gaggtaatag 
acaattggag 
cacccaccag 
ctgtgttcct 
tgacggtaca 
gggctattga 
aggcaagagt 
gttgctctgg 
aatctgtgga 
attacacaga 
aaaaagaatt 
actggctgtg 



cttgctcctt 
ctattatggg 
taaagcctat 
ccccaaccca 
taacatggta 
atgtgtaaaa 
gaatactact 
ttgctctttc 
taatagactt 
ttgtaacacc 
acattattgt 
atcaggacca 
gtcaactcaa 
aaatttcaca 
ttgtacaaga 
ttatgcaaga 
aaaatggaat 
aacaatagcc 
ttgtggaggg 
tactggaggg 
acaaattata 
acaaattaga 
tactgagact 
aagtgaatta 
ggcaaagaga 
tgggttcttg 
ggccaggcta 
ggcgcaacag 
cctggctctg 
aaaactcatt 
tgatatttgg 
ctatatatat 
attggaattg 
gtatataaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2051 
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<210> 
<211> 
<212> 
<213> 



4 

667 
PRT 

Human immunodeficiency virus type 1 



<220> 



<223> human immunodeficiency virus-1 (HIV-1) envelope 
glycoprotein gpl40 truncated version of gp 160 
from primary R5X4 isolate 8 9.6 

<400> 4 

Met Arg Val Lys Glu lie Arg Lys Asn Trp Gin His Leu Arg Gly Gly 
1 5 10 15 

lie Leu Leu Leu Gly Met Leu Met lie Cys Ser Ala Ala Lys Glu Lys 
20 25 30 

Thr Trp Val Thr lie Tyr Tyr Gly Val Pro Val Trp Arg Glu Ala Thr 
35 40 45 

Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 
50 55 60 

His Asn Val Trp Ala Thr His Ala Cys Val. Pro Thr Asp Pro Asn Pro 
65 70 75 80 

Gin Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys 
85 90 95 

Asn Asn Met Val Asp Gin Met His Glu Asp lie lie Ser Leu Trp Asp 
100 105 110 

Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 
115 120 125 

Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn Pro Thr Ser 
130 135 140 

Ser Ser Trp Gly Met Met Glu Lys Gly Glu lie Lys Asn Cys Ser Phe 
145 150 155 160 

Tyr lie Thr Thr Ser lie Arg Asn Lys Val Lys Lys Glu Tyr Ala Leu 
165 170 175 

Phe Asn Arg Leu Asp Val Val Pro lie Glu Asn Thr Asn Asn Thr Lys 
180 185 190 

Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin Ala Cys Pro 
195 200 205 

Lys Val Ser Phe Gin Pro lie Pro He His Tyr Cys Val Pro Ala Gly 
210 215 220 

Phe Ala Met Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Ser Gly Pro 
225 230 235 240 

Cys Thr Asn Val Ser Thr Val Gin Cys Thr His Gly He Arg Pro Val 
245 250 255 

Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Asp He 



260 



265 



270 
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Val lie Arg Ser 
275 

Gin Leu Asn Glu 
290 

Thr Arg Arg Arg 
305 

Arg Asn lie lie 



Ala Lys Trp Asn 
340 

Lys Phe Arg Asn 
355 

Pro Glu He Val 
370 

Cys Asn Thr Ala 
385 

Thr Asn Gly Thr 



Lys Gin He He 
420 

Pro Pro He Thr 
435 

Leu Leu Thr Arg 
450 

Phe Arg Pro Gly 
465 

Tyr Lys Tyr Lys 



Arg Ala Lys Arg 
500 

Gly Ala Val Phe 
515 

Ala Ala Ser Val 
530 

He Val Gin Gin 
545 

His Met Leu Gin 



Val Leu Ala Leu 
580 




Glu Asn Phe Thr 
280 

Ser Val Val He 
295 

Leu Ser He Gly 
310 

Gly Asp He Arg 
325 

Asn Thr Leu Gin 



Lys Thr He Ala 
360 

Met His Ser Phe 
375 

Gin Leu Phe Asn 
390 

Glu Gly Asn Asp 
405 

Asn Met Trp Gin 



Gly Gin He Arg 
440 

Asp Gly Gly Asn 
455 

Gly Gly Asp Met 
470 

Val Val Arg He 
485 

Arg Thr Val Gin 



Leu Gly Phe Leu 
520 

Thr Leu Thr Val 
535 

Gin Asn Asn Leu 
550 

Leu Thr Val Trp 
565 

Glu Arg Tyr Leu 



Asp Asn Ala Lys 



Asn Cys Thr Arg 
300 

Pro Gly Arg Ala 
315 

Gin Ala His Cys 
330 

Gin He Val He 
345 

Phe Asn Gin Ser 



Asn Cys Gly Gly 
380 

Ser Thr Trp Asn 
395 

He He Thr Leu 
410 

Lys Val Gly Lys 
425 

Cys Ser Ser. Asn 



Ser Thr Glu Thr 
460 

Arg Asp Asn Trp 
475 

Glu Pro He Gly 
490 

Arg Glu Lys Arg 
505 

Gly Ala Ala Gly 



Gin Ala Arg Leu 
540 

Leu Arg Ala He 
555 

Gly He Lys Gin 
570 

Arg Asp Gin Gin 
585 




Thr He He Val 
285 

Pro Asn Asn Asn 



Phe Tyr Ala Arg 
320 

Asn He Ser Arg 
335 

Lys Leu Arg Glu 
350 

Ser Gly Gly Asp 
365 

Glu Phe Phe Tyr 



Val Thr Gly Gly 
400 

Gin Cys Arg He 
415 

Ala Met Tyr Ala 
430 

He Thr Gly Leu 
445 

Glu Thr Glu He 



Arg Ser Glu Leu 
480 

Val Ala Pro Thr 
495 

Ala Val Gly He 
510 

Ser Thr Met Gly 
525 

Leu Leu Ser Gly 



Glu Ala Gin Gin 
560 

Leu Gin Ala Arg 
575 

Leu Met Gly He 
590 
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Trp Gly Cys Ser Gly Lys Leu lie 
595 600 

Val Ser Trp Ser Asn Lys Ser Val 
610 615 

Trp Met Glu Trp Glu Arg Glu lie 
625 630 

Asp Leu Leu Glu Lys Ser Gin Thr 
645 

Leu Leu Glu Leu Asp Lys Trp Ala 
660 



Cys Thr Thr Ser Val Pro Trp Asn 
605 

Asp Asp lie Trp Asn Asn Met Thr 
620 

Asp Asn Tyr Thr Asp Tyr lie Tyr 
635 640 

Gin Gin Glu Lys Asn Glu Lys Glu 
650 655 

Ser Leu Trp 
665 



<210> 5 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cleavage site 
and start of gp41 in gpl60 (env 89.6) 

<400> 5 

caaagagaaa aaagagcagt gggaata 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : cleavage site 
and start of gp41 in gpl60 (env 89.6) 

<400> 6 

Arg Glu Lys Arg Ala Val Gly lie 
1 5 



<210> 7 

<211> 667 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : gpl40 -26 
(soluble, secreted protein) 



<400> 7 

Lys Glu Lys Thr Trp Val Thr lie 
1 5 

Glu Ala Thr Thr Thr Leu Phe Cys 
20 

Thr Glu Val His Asn Gly Trp Ala 
35 40 



Tyr Tyr Gly Val Pro Val Trp Arg 
10 15 

Ala Ser Asp Ala Lys Ala Tyr Asp 
25 30 

Thr His Ala Cys Val Ala Thr Asp 
45 
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Pro Asn Pro Gin 
50 

Met Trp Lys Asn 
65 

Leu Leu Asp Glu 



Val Thr Leu Asn 
100 

Pro Thr Ser Ser 
115 

Cys Ser Phe Tyr 
130 

Tyr Ala Leu Phe 
145 

Asn Thr Lys Tyr 



Ala Cys Pro Lys 
180 

Pro Ala Gly Phe 
195 

Ser Gly Pro Cys 
210 

Arg Pro Val Val 
225 

Glu Asp lie Val 



lie lie Val Gin 
260 

Asn Asn Asn Thr 
275 

Tyr Ala Arg Arg 
290 

lie Ser Arg Ala 
305 

Leu Arg Glu Lys 



Gly Gly Asp Pro 
340 

Phe Phe Tyr Cys 
355 



Glu Val Val Leu 
55 

Asn Met Val Asp 
70 

Ser Leu Lys Pro 
85 

Cys Thr Asn Leu 



Ser Leu Gly Met 
120 

lie Thr Thr Ser 
135 

Asn Arg Leu Asp 
150 

Arg Leu lie Ser 
165 

Val Phe Phe Gin 



Ala Met Leu Lys 
200 

Thr Asn Val Ser 
215 

Ser Thr Gin Leu 
230 

lie Arg Ser Gly 
245 

Leu Asn Glu Ser 



Arg Arg Arg Leu 
280 

Asn lie lie Gly 
295 

Lys Leu Asn Asn 
310 

Phe Arg Asn Lys 
325 

Glu lie Val Met 



Asn Thr Ala Gin 
360 



Gly Asn Val Thr 
60 

Gin Met His Glu 
75 

Cys Val Lys Leu 
90 

Asn lie Thr Lys 
105 

Met Glu Lys Gly 



lie Arg Asn Lys 
140 

Val Val Pro He 
155 

Cys Asn Thr Ser 
170 

Pro He Ala He 
185 

Cys Asn Asn Lys 



Thr Val Pro Cys 
220 

Leu Leu Asn Gly 
235 

Asn Phe Thr Asp 
250 

Val Val He Asn 
265 

Ser He Gly Pro 



Asp He Arg Gin 
300 

Thr Leu Gin Gin 
315 

Thr He Ala Phe 
330 

His Ser Phe Asn 
345 

Leu Phe Asn Ser 



Glu Asn Phe Asn 



Asp He He Ser 
80 

Thr Pro Leu Cys 
95 

Asn Thr Thr Asn 
110 

Glu He Lys Asn 
125 

Val Lys Lys Glu 



Glu Asn Thr Asn 
160 

Val He Thr Gin 
175 

His Tyr Cys Val 
190 

Thr Phe Asn Gly 
205 

Thr His Gly He 



Ser Leu Ala Glu 
240 

Asn Ala Lys Thr 
255 

Cys Thr Arg Pro 
270 

Gly Arg Ala Phe 
285 

Ala His Cys Asn 



He Val He Lys 
320 

Asn Gin Ser Ser 
335 

Cys Gly Gly Glu 
350 

Thr Leu Asn Val 
365 
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Thr Gly Gly Thr Asn Gly Thr Glu Glu Asn Asp lie lie Thr Leu Gin 
370 375 380 

Cys Arg lie Lys Gin lie lie Asn Met Trp Gin Lys Val Gly Lys Ala 
385 390 395 400 

Met Tyr Ala Pro Pro lie Thr Gly Gin lie lie Cys Ser Ser Asn lie 
405 410 415 

Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Ser Thr Glu Thr Glu 
420 425 430 

Thr Glu lie Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg 
435 440 445 

Ser Glu Leu Tyr Lys Tyr Lys Val Val Arg lie Glu Pro lie Gly Val 
450 455 460 

Ala Pro Thr Arg Ala Lys Arg Arg Thr Cys Gin Gly Gly lie Asp Gly 
465 470 475 480 

lie Leu Gin lie Ser Gly Ser Gly Ser Gly Gly Ser Gly Gin Gly Ser 
485 490 495 

Ser Ser Gly Gly Ala Gly Gly Lys Gly Ala Val Gly lie Gly Ala Val 
500 505 510 

Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Arg Ser 
515 520 525 

Val Thr Leu Thr Val Gin Ala Arg Leu Leu Leu Ser Gly lie Val Gin 
530 535 540 

Gin Gin Asn Asn Leu Leu Arg Ala lie Glu Ala Gin Gin His Met Leu 
545 550 555 560 

Gin Leu Thr Val Trp Gly lie Lys Gin Leu Gin Ala Arg Val Leu Ala 
565 570 575 

Leu Glu Arg Tyr Leu Arg Asp Gin Gin Leu Met Gly lie Trp Gly Cys 
580 585 590 

Ser Gly Lys Leu lie Cys Thr Thr Ser Val Pro Trp Asn Val Ser Trp 
595 600 605 

Ser Asn Lys Ser Val Asp Asp lie Trp Asn Asn Met Thr Trp Met Glu 
610 615 620 

Leu Glu Arg Glu lie Asp Asn Tyr Thr Asp Tyr lie Tyr Asp Leu Leu 
625 630 635 640 

Glu Lys Ser Gin Thr Gin Gin Glu Lys Asn Glu Lys Glu Leu Leu Glu 
645 650 655 

Leu Asp Lys Trp Ala Ser Leu Trp Lys Leu Val 
660 665 
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<210> 8 
<211> 656 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : gpl4 0 - 15 
(soluble, secreted protein) 

<400> 8 

Lys Glu Lys Thr Trp Val Thr lie Tyr Tyr Gly Val Pro Val Trp Arg 
15 10 15 

Glu Ala Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp 
20 25 30 

Thr Glu Val His Asn Gly Trp Ala Thr His Ala Cys Val Ala Thr Asp 
35 40 45 

Pro Asn Pro Gin Glu Val Val Leu Gly Asn Val Thr Glu Asn Phe Asn 
50 55 60 

Met Trp Lys Asn Asn Met Val Asp Gin Met His Glu Asp lie lie Ser 
65 70 75 80 

Leu Leu Asp Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys 
85 90 95 

Val Thr Leu Asn Cys Thr Asn Leu Asn lie Thr Lys Asn Thr Thr Asn 
100 105 110 

Pro Thr Ser Ser Ser Leu Gly Met Met Glu Lys Gly Glu lie Lys Asn 
115 120 125 

Cys Ser Phe Tyr lie Thr Thr Ser lie Arg Asn Lys Val Lys Lys Glu 
13 0 13 5 14 0 

Tyr Ala Leu Phe Asn Arg Leu Asp Val Val Pro lie Glu Asn Thr Asn 
145 150 155 160 

Asn Thr Lys Tyr Arg Leu lie Ser Cys Asn Thr Ser Val lie Thr Gin 
165 170 175 

Ala Cys Pro Lys Val Phe Phe Gin Pro lie Ala lie His Tyr Cys Val 
180 185 190 

Pro Ala Gly Phe Ala Met Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly 
195 200 205 

Ser Gly Pro Cys Thr Asn Val Ser Thr Val Pro Cys Thr His Gly lie 
210 215 220 

Arg Pro Val Val Ser Thr Gin Leu Leu Leu Asn Gly Ser Leu Ala Glu 
225 230 235 240 

Glu Asp lie Val lie Arg Ser Gly Asn Phe Thr Asp Asn Ala Lys Thr 
245 250 255 

lie lie Val Gin Leu Asn Glu Ser Val Val lie Asn Cys Thr Arg Pro 
260 265 270 
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Asn Asn Asn Thr 
275 

Tyr Ala Arg Arg 
290 

lie Ser Arg Ala 
305 

Leu Arg Glu Lys 



Gly Gly Asp Pro 
340 

Phe Phe Tyr Cys 
355 

Thr Gly Gly Thr 
370 

Cys Arg lie Lys 
385 

Met Tyr Ala Pro 



Thr Gly Leu Leu 
420 

Thr Glu lie Phe 
435 

Ser Glu Leu Tyr 
450 

Ala Pro Thr Arg 
465 

lie Leu Gin lie 



Gly lie Gly Ala 
500 

Met Gly Ala Arg 
515 

Ser Gly lie Val 
530 

Gin Gin His Met 
545 

Ala Arg Val Leu 



Gly lie Trp Gly 
580 



Arg Arg Arg Leu 
280 

Asn lie lie Gly 
295 

Lys Leu Asn Asn 
310 

Phe Arg Asn Lys 
325 

Glu lie Val Met 



Asn Thr Ala Gin 
360 

Asn Gly Thr Glu 
375 

Gin lie lie Asn 
390 

Pro lie Thr Gly 
405 

Leu Thr Arg Asp 



Arg Pro Gly Gly 
440 

Lys Tyr Lys Val 
455 

Ala Lys Arg Arg 
470 

Ser Ser Ser Gly 
485 

Val Phe Leu Gly 



Ser Val Thr Leu 
520 

Gin Gin Gin Asn 
535 

Leu Gin Leu Thr 
550 

Ala Leu Glu Arg 
565 

Cys Ser Gly Lys 



Ser lie Gly Pro 



Asp lie Arg Gin 
300 

Thr Leu Gin Gin 
315 

Thr lie Ala Phe 
330 

His Ser Phe Asn 
345 

Leu Phe Asn Ser 



Glu Asn Asp lie 
380 

Met Trp Gin Lys 
395 

Gin lie lie Cys 
410 

Gly Gly Asn Ser 
425 

Gly Asp Met Arg 



Val Arg lie Glu 
460 

Thr Cys Gin Gly 
475 

Gly Ala Gly Gly 
490 

Phe Leu Gly Ala 
505 

Thr Val Gin Ala 



Asn Leu Leu Arg 
540 

Val Trp Gly lie 
555 

Tyr Leu Arg Asp 
570 

Leu lie Cys Thr 
585 



Gly Arg Ala Phe 
285 

Ala His Cys Asn 



He Val He Lys 
320 

Asn Gin Ser Ser 
335 

Cys Gly Gly Glu 
350 

Thr Leu Asn Val 
365 

He Thr Leu Gin 



Val Gly Lys Ala 
400 

Ser Ser Asn He 
415 

Thr Glu Thr Glu 
430 

Asp Asn Trp Arg 
445 

Pro He Gly Val 



Gly He Asp Gly 
480 

Lys Gly Ala Val 
495 

Ala Gly Ser Thr 
510 

Arg Leu Leu Leu 
525 

Ala He Glu Ala 



Lys Gin Leu Gin 
560 

Gin Gin Leu Met 
575 

Thr Ser Val Pro 
590 
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+ 



Trp Asn Val Ser 
595 

Met Thr Trp Met 
610 

lie Tyr Asp Leu 
625 

Lys Glu Leu Leu 



Trp Ser Asn Lys 
600 

Glu Leu Glu Arg 
615 

Leu Glu Lys Ser 
630 

Glu Leu Asp Lys 
645 



Ser Val Asp Asp 



Glu lie Asp Asn 
620 

Gin Thr Gin Gin 
635 

Trp Ala Ser Leu 
650 



lie Trp Asn Asn 
605 

Tyr Thr Asp Tyr 



Glu Lys Asn Glu 
640 

Trp Lys Leu Val 
655 



<210> 9 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : peptide linker 
<400> 9 

Gly lie Leu lie 
1 



<210> 10 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 10 

Gly Gly lie Asp Gly lie Leu Gin lie Ser Ser Ser Gly Gly Ala 
15 10 15 



<210> 11 
<211> 26 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 11 

Gly Gly lie Asp Gly He Leu Gin He Ser Gly Ser Gly Ser Gly Gly 
15 10 15 

Ser Gly Gin Gly Ser Ser Ser Gly Gly Ala 
20 25 



<210> 12 

<211> 15 

<212> PRT 

<213> Artificial 



Sequence 
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<220> 

<223> Description of Artificial Sequence :pref erred 
peptide linker 

<400> 12 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
15 10 15 



<210> 13 
<211> 20 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 13 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly 
20 



<210> 14 
<211> 25 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : preferred 
peptide linker 

<400> 14 

Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 



<210> 15 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> M0D_RES 
<222> (7) . . (24) 

<223> amino acids at positions 7-8, 9-10, 11-12, 13-14, 
15-16, 17-18, 19-20, 21-22, and 23-24 may be 
present or absent 
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# 



<400> 15 

Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser Gly Ser 
1 5 10 15 

Gly Ser Gly Ser Gly Ser Gly Ser 
20 



<210> 16 
<211> 48 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> M0D_RES 
<222> (13) . . (48) 

<223> amino acids at positions 13-16, 17-21, 22-24, 

25-28, 29-32, 33-36, 37-40, 41-44, and 45-48 may 
be present or absent 

<400> 16 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
1 5 10 15 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
35 40 45 



<210> 17 
<211> 60 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> M0D_RES 
<222> (16) . . (60) 

<223> amino acids at positions 16-20, 21-25, 26-30, 

31-35, 36-40, 41-45, 46-50, 51-55 and 55-60 may be 
present or absent 

<400> 17 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1 5 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 
20 25 30 
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Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser 
50 55 60 



<210> 18 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<220> 

<221> MOD_RES 
<222> (19) . . (72) 

<223> amino acids at positions 19-24, 25-30, 31-36, 

37-42, 43-48, 49-54, 55-60, 61-66 and 67-72 may be 
present or absent 

<400> 18 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
1 5 10 15 

Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 

Gly Ser Gly Gly Gly Gly Gly Ser 
65 70 



<210> 19 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 19 

Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser 
15 10 



<210> 20 

<211> 20 

<212> PRT 

<213> Artificial 



Sequence 
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# 



<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 20 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 
1 5 10 15 

Gly Gly Gly Ser 
20 



<210> 21 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 21 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Ser 
20 25 30 



<210> 22 
<211> 42 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 22 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser Gly Gly 
1 5 10 15 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
20 25 30 

Gly Gly Ser Gly Gly Gly Gly Gly Gly Ser 
35 40 



<210> 23 
<211> 56 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 23 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
1 " A 5 10 15 
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Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Ser 
50 55 



<210> 24 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 24 

Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly 
50 55 60 

Gly Gly Gly Gly Gly Gly Gly Ser 
65 70 



<210> 25 
<211> 90 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 25 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
3 5 4 0 4 5 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 
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Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
85 90 



<210> 26 
<211> 110 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 26 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
85 90 95 

Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
100 105 110 



<210> 27 
<211> 132 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 27 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
1 5 10 15 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
50 55 60 
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Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
65 70 75 80 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
85 90 95 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
115 120 125 

Gly Gly Gly Ser 
130 



<210> 28 
<211> 156 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : exemplary 
linker 

<400> 28 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly 
1 5 10 15 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly 
" 85 90 95 

Gly Gly Gly Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
115 12 0 12 5 

Gly Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser Gly 
13 0 13 5 14 0 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Ser 
145 150 155 



<210> 29 
<211> 36 
<212> PRT 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence : exemplary- 
linker 

<220> 

<221> MOD_RES 
<222> (10) . . (36) 

<223> amino acids at positions 10-12, 13-15, 16-18, 

19-21, 22-24, 25-27, 28-30, 31-33 and 34-36 may be 
present or absent 

<400> 29 

Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala 
15 10 15 

Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly Ser Ala Gly 
20 25 30 

Ser Ala Gly Ser 
35 



<210> 30 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : post 
translational cleavage site 

<400> 30 
Arg Glu Lys Arg 
1 



<210> 31 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : mutated post 
translational cleavage site 

<400> 31 
Arg Glu lie Asp 
1 



<210> 32 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : short fragment 
following mutated cleavage site created by 
introduction of restriction sites 
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<400> 32 
Glu Phe lie Ser 
1 



<210> 33 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : polypeptide 
linker end 

<400> 33 

Gly Gly Ser Gly Gly 
1 " 5 
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